Abstract: Automatic data summarization is part of machine learning and text mining, in which source text is condensed into a shorter version preserving its information content and overall meaning. First developed as a labour-intensive manual discipline in the 1980s, text mining has become ever more efficient as computing power has increased. In-A-Nutshell is an attempt to create a robust automated text summarization system, based on sentence scoring.
Keywords: text mining, summarization, NLP, extraction, abstraction, cue phrases, sentence generation.